Perceptual evaluation of blind source separation for robust speech recognition
نویسندگان
چکیده
In a previous article, an evaluation of several objective quality measures as predictors of recognition rate after application of a blind source separation algorithm was reported. In this work, the experiments were repeated using some new measures, based on the perceptual evaluation of speech quality (PESQ), which is part of the ITU P862 standard for evaluation of communication systems. The raw PESQ and a nonlinearly transformed PESQ were evaluated, together with several composite measures. The results show that the PESQ-based measures outperformed all the measures reported in the previous work. Based on these results, we recommend the use of PESQ-based measures to evaluate blind source separation algorithms for automatic speech recognition.
منابع مشابه
Real-Time Prototype for Integration of Blind Source Extraction and Robust Automatic Speech Recognition
This demo presents a real-time prototype for automatic blind source extraction and speech recognition in presence of multiple interfering noise sources. Binaural recorded mixtures are processed by a combined Blind/Semi-Blind Source Separation algorithm in order to obtain an estimation of the target signal. The recovered target signal is segmented and used as input to a real-time automatic speec...
متن کاملEvaluation of missing data techniques for in-car automatic speech recognition
One of the major concerns in deploying speech recognition applications is the lack of robustness of the technology. One key aspect is the sensitivity to stationary or non-stationary background noise. Many approaches to noise robust speech recognition have been proposed before. Some modify the front-end signal processing of the recogniser while others work on the back-end, i.e. modelling and dec...
متن کاملA spatio-temporal speech enhancement scheme for robust speech recognition in noisy environments
A new speech enhancement scheme is presented integrating spatial and temporal signal processing methods for robust speech recognition in noisy environments. The scheme first separates spatially localized point sources from noisy speech signals recorded by two microphones. Blind source separation algorithms assuming no a priori knowledge about the sources involved are applied in this spatial pro...
متن کاملA Two-Channel Acoustic Front-End for Robust Automatic Speech Recognition in Noisy and Reverberant Environments
An acoustic front-end for robust automatic speech recognition in noisy and reverberant environments is proposed in this contribution. It comprises a blind source separation-based signal extraction scheme and only requires two microphone signals. The proposed front-end and its integration into the recognition system is analyzed and evaluated in noisy living room-like environments according to th...
متن کاملSpatio-temporal Speech Enhancement for Robust Speech Recognition
A new speech enhancement scheme is presented integrating spatial and temporal signal processing methods for robust speech recognition in noisy environments. The scheme first separates spatially localized point sources from noisy speech signals recorded by two microphones. Blind source separation algorithms assuming no a priori knowledge about the sources involved are applied in this spatial pro...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Signal Processing
دوره 88 شماره
صفحات -
تاریخ انتشار 2008